### Data
`*_data` folder contains random sampled data. We will release the full dataset after the paper is published. There are three files: `train.json`, `valid.json`, `test.json`  under `*_data`  folder which are used for training, validation, and testing respectively. Each file contains multiple lines. Each line represent an instance. The schema for each instance is listed below:
```

{
    "title":        #   goal of activity
    "method":       #   subgoal of activity
    "steps":        #   list of step text 
    "captions":     #   list of corresponding captions of step
    "target":       #   next step text
    "img":          #   last step image id
    "target_img":   #   next step image id
    "retrieve":     #   20 retrieved historical relevant steps
    "retrieve_neg": #   list of retrieved top-20 most similar steps with respect to the last step. They will serve as retrieve-negatives 
    }
```
The `img` subfolder in the `*_data` folder contains some sample images and the corresponding wikihow task json file for the gardening and crafts datasets.